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213 <211> LENGTH: 2869 

214 <212> TYPE: DNA 

215 <213> ORGANISM: Acidothermus cellulolyticus 

217 <220> FEATURE: 

218 <221> NAME/KEY: modif ied_base 

219 <222> LOCATION: (2869) 

220 <223> OTHER INFORMATION: a, c, t, g, other or unknown 

222 <400> SEQUENCE: 2 

223 atggatcgtt cggagaacat ccgtctgact atgagatcac gacgattggt atcactgctc 6,0 

224 gccgccactg cgtcgttcgc cgtggccgcc gctctgggag ttctgcccat cgcgataacg 120 

225 gcttctcctg cgcacgcggc gacgactcag ccgtacacct ggagcaacgt ggcgatcggg 180 

226 ggcggcggct ttgtcgacgg gatcgtcttc aatgaaggtg caccgggaat tctgtacgtg 240 

227 cggacggaca tcggggggat gtatcgatgg gatgccgcca acgggcggtg gatccctctt 300 

228 ctggattggg tgggatggaa caattggggg tacaacggcg tcgtcagcat tgcggcagac 360 

229 ccgatcaata ctaacaaggt atgggccgcc gtcggaatgt acaccaacag ctgggaccca 420 
2 30 aacgacggag cgattctccg ctcgtctgat cagggcgcaa cgtggcaaat aacgcccctg 4 80 
231 ccgttcaagc ttggcggcaa catgcccggg cgtggaatgg gcgagcggct tgcggtggat 540 
2 32 ccaaacaatg acaacattct gtatttcggc gccccgagcg gcaaagggct ctggagaagc 6 00 

233 acagattccg gcgcgacctg gtcccagatg acgaactttc cggacgtagg cacgtacatt 660 

234 gcaaatccca ctgacacgac cggctatcag agcgatattc aaggcgtcgt ctgggtcgct 720 

235 ttcgacaagt cttcgtcatc gctcgggcaa gcgagtaaga ccatttttgt gggcgtggcg 780 

236 gatcccaata atccggtctt ctggagcaga gacggcggcg cgacgtggca ggcggtgccg 840 

237 ggtgcgccga ccggcttcat cccgcacaag ggcgtctttg acccggtcaa ccacgtgctc 900 

238 tatattgcca ccagcaatac gggtggtccg tatgacggga gctccggcga cgtctggaaa 960 

239 ttctcggtga cctccgggac atggacgcga atcagcccgg taccttcgac ggacacggcc 1020 

240 aacgactact ttggttacag cggcctcact atcgaccgcc agcacccgaa cacgataatg 1080 

241 gtggcaaccc agatatcgtg gtggccggac accataatct ttcggagcac cgacggcggt 1140 

242 gcgacgtgga cgcggatctg ggattggacg agttatccca atcgaagctt gcgatatgtg 1200 

243 cttgacattt cggcggagcc ttggctgacc ttcggcgtac agccgaatcc tcccgtaccc 1260 

244 agtccgaagc tcggctggat ggatgaagcg atggcaatcg atccgttcaa ctctgatcgg 1320 

245 atgctctacg gaacaggcgc gacgttgtac gcaacaaatg atctcacgaa gtgggactcc 1380 

246 ggcggccaga ttcatatcgc gccgatggtc aaaggattgg aggagacggc ggtaaacgat 1440 

247 ctcatcagcc cgccgtctgg cgccccgctc atcagcgctc tcggagacct cggcggcttc 1500 

248 acccacgccg acgttactgc cgtgccatcg acgatcttca cgtcaccggt gttcacgacc 1560 

249 ggcaccagcg tcgactatgc ggaattgaat ccgtcgatca tcgttcgcgc tggaagtttc 1620 

250 gatccatcga gccaaccgaa cgacaggcac gtcgcgttct cgacagacgg cggcaagaac 1680 

251 tggttccaag gcagcgaacc tggcggggtg acgacgggcg gcaccgtcgc cgcatcggcc 1740 

252 gacggctctc gtttcgtctg ggctcccggc gatcccggtc agcctgtggt gtacgcagtc 1800 

253 ggatttggca actcctgggc tgcttcgcaa ggtgttcccg ccaatgccca gatccgctca 1860 

254 gaccgggtga atccaaagac tttctatgcc ctatccaatg gaaccttcta tcgaagcacg 1920 

255 gacggcggcg tgacattcca accggtcgcg gccggtcttc cgagcagcgg tgccgtcggt 1980 

256 gtcatgttcc acgcggtgcc tggaaaagaa ggcgatctgt ggctcgctgc atcgagcggg 2040 

257 ctttaccact caaccaatgg cggcagcagt tggtctgcaa tcaccggcgt atcctccgcg 2100 

258 gtgaacgtgg gatttggtaa gtctgcgccc gggtcgtcat acccagccgt ctttgtcgtc 2160 
2 59 ggcacgatcg gaggcgttac gggggcgtac cgctccgacg actgtgggac gacctgggta 222 0 

260 ctgatcaatg atgaccagca ccaatacgga aattggggac aagcaatcac cggtgaccac 2280 

261 gcgaatttac ggcgggtgta cataggcacg aacggccgtg gaattgtata cggggacatt 2340 

262 ggtggtgcgc cgtccggatc gccgtctccg tcggtgagtc cgtcggcttc gccgagcctg 2400 

263 agcccgagcc cgagcccgag cagctcgcca tcgccgtcgc cgtcgccgag ctcgagtcca 2460 
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264 tcctcgtcgc cgtctccgtc gccgtcacca tcgccgagtc cgtctcggtc tccgtcacca 2520 

265 tcggcgtcgc cgagcccgtc ttcgtcaccg agcccgtctt cgtcaccgtc ttcgtcgccg 2580 

266 agcccaacgc cgtcgtcgtc gccggtgtcg ggtggggtga aggtgcagta taagaataat 2640 

267 gattcggcgc cgggtgataa tcagatcaag ccgggtttgc aggtggtgaa taccgggtcg 2700 

268 tcgtcggtgg atttgtcgac ggtgacggtg cggtactggt tcacccggga tggtggctcg 2760 
, 269 tcgacactgg tgtacaactg tgactgggcg gcgatcgggt gtgggaatat ccgcgcctcg 2820 
> 2 70 ttcggctcgg tgaacccggc gacgccgacg gcggacacct acctgcagn 2869 

273 <210> SEQ ID NO : 3 

274 <211> LENGTH: 740 

275 <212> TYPE: PRT 

276 <213> ORGANISM: Acidothermus cellulolyticus 

278 <220> FEATURE: 

279 <223> OTHER INFORMATION: Catalytic domain GH74 
281 <400> SEQUENCE: 3 
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Tyr 


val 


Arg 
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Gly 
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Met 


Tyr 


Arg 


Trp 


Asp 
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Aia 


Asn 


O O Q 
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2 yi 


Gly 


Arg 


Trp 


lie 


Pro 


Leu 


Leu 


Asp 


Trp 


vai 
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Trp 


Asn 


Asn 


Trp 


ijiy 


TOO 

2 y z 














D D 
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Tyr 
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Gly 


Val 
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Ala 


Asp 


Pro 


lie 


Asn 


Thr 


Asn 


Lys 
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297 


Val 


Trp 


Ala 


Ala 


Val 


Gly 


Met 


Tyr 


Thr 


Asn 


Ser 


Trp 


Asp 


Pro 


Asn 


Asp 


298 










85 










90 










95 




300 


Gly 


Ala 


He 


Leu 


Arg 


Ser 


Ser 


Asp 


Gin 


Gly 


Ala 


Thr 


Trp 


Gin 


He 


Thr 


301 








100 










105 










110 






303 


Pro 


Leu 


Pro 


Phe 


Lys 


Leu 


Gly 


Gly 


Asn 


Met 


Pro 


Gly 


Arg 


Gly 


Met 


Gly 


304 






115 










120 










125 








306 


Glu 


Arg 


Leu 


Ala 


Val 


Asp 


Pro 


Asn 


Asn 


Asp 


Asn 


He 


Leu 


Tyr 


Phe 


Gly 


307 




130 










135 










140 










309 


Ala 


Pro 


Ser 


Gly 


Lys 


Gly 


Leu 


Trp 


Arg 


Ser 


Thr 


Asp 


Ser 


Gly 


Ala 


Thr 


310 


145 










150 










155 










160 


312 


Trp 


Ser 


Gin 


Met 


Thr 


Asn 


Phe 


Pro 


Asp 


Val 


Gly 


Thr 


Tyr 


He 


Ala 


Asn 


313 










165 
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315 


Pro 


Thr 


Asp 


Thr 


Thr 


Gly 


Tyr 


Gin 


Ser 


Asp 


He 


Gin 


Gly 


Val 


Val 


Trp 
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180 
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318 


Val 


Ala 


Phe 


Asp 


Lys 


Ser 


Ser 


Ser 


Ser 


Leu 


Gly 


Gin 


Ala 


Ser 


Lys 


Thr 


319 






195 










200 










205 








321 


He 


Phe 


Val 


Gly 


Val 


Ala 


Asp 


Pro 


Asn 


Asn 


Pro 


Val 


Phe 


Trp 


Ser 


Arg 


322 
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220 










324 


Asp 


Gly 


Gly 


Ala 


Thr 


Trp 


Gin 


Ala 


Val 


Pro 


Gly 


Ala 


Pro 


Thr 


Gly 


Phe 


325 


225 










230 










235 










240 


327 


He 


Pro 


His 


Lys 


Gly 


Val 


Phe 


Asp 


Pro 


Val 


Asn 


His 


Val 


Leu 


Tyr 


He 


328 










245 










250 










255 




330 


Ala 


Thr 


Ser 


Asn 


Thr 


Gly 


Gly 


Pro 


Tyr 


Asp 


Gly 


Ser 


Ser 


Gly 


Asp 


Val 


331 








260 










265 










270 






333 


Trp 


Lys 


Phe 


Ser 


Val 


Thr 


Ser 


Gly 


Thr 


Trp 


Thr 


Arg 


He 


Ser 


Pro 


Val 



Us© of n and/or Xaa has been detected ifi tfi© §egii§n?9 IMflP: 
R? v l'§W the Sequence Listing to insure a corresponding ' v 
SSglafiation is presented in the <220> to <223> fields of 
Sach sequence using n or Xaa. 
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VERIFICATION SUMMARY 

^PATENT APPLICATION: US/09/917,376 



DATE: 01/29/2002 
TIME: 12:18:04 



Input Set : A:\401974us.app 

Output Set: N:\CRF3\01292002\I917376.raw 



L:208 M:341 W 
L:270 M:341 W 
L:453 M:341 W 



(46) n n n or n Xaa n used, for SEQ ID#:1 
(46) "n" or "Xaa" used, for SEQ ID#:2 
(46) "n" or "Xaa" used, for SEQ ID# : 4 
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